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new spectral properties, for use in biological systems, especially those 
where fluorescence resonance energy transfer (FRET) is used to study the 
biological system. 

An example of a FRET-based method for studying biological systems is 
described in detail in UK Patent Application No 9817229.9 entitled 
"Fluorescent assay for biological systems" and the PCT application 
claiming priority from that application and which was filed on the same 
day as this application. A further example of a FRET-based method is 
described in Miyawaki et al (1997) Nature 388, 882-887. 

A first aspect of the invention provides a polypeptide comprising the 
functional portion of green fluorescent protein but wherein valine (V) 163 
has been replaced with alanine (A), serine (S) 175 has been replaced with 
glycine (G), isoleucine (I) 167 has been replaced with threonine (T), 
phenylalanine (F) 64 has been replaced with leucine (L), serine (S) 65 has 
been replaced with threonine (T), serine (S) 72 has been replaced with 
alanine (A), and threonine (T) 203 has been replaced with tyrosine (Y). 

By "functional portion" we include the meaning that it is the portion of the 
protein which, in the absence of other portions of GFP, gives rise to 
useful fluorescent properties, such as the portion being fluorescent. It will 
be appreciated that, in respect of this first aspect of the invention the GFP 
or polypeptide comprising the functional portion of GFP with the given 
mutations may also include other mutations which may confer further 
desirable properties. 
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A second aspect of the invention provides a polypeptide which has the 
amino acid sequence 

MGKGEELFTGVVPILVEIJXjDVNGQKFSVRGEGE 
GVQCFARYPDHMKR 

HDFLKSAMPEGYVQERTIFFKDDGNTKT 
SHNVYIMADKQKNG 

IKANFKTRHNIEDGGVQLADHYQQNTPIGD^ 
AGITHGMDELYKSF 

This is the sequence shown in Figure 3. 

Conveniently, the difference with respect to the native GFP can be 
summarised as S2G, H25Q, S30R, F64L, S65T, S72A, Q80R, F84L, 
V163A, I167T, S175G, T203Y, K209Q, N212H, E213G, A216S, + 
239S + 240F, making use of standard single letter amino acid code. 

By "green fluorescent protein", in the context of a single protein, we 
mean wild-type green fluorescent protein as described in Prasher et al 
(1992) Gene 111, 229-233 and whose amino acid sequence is given in 
Figure 1 . As noted above, the term GFP may be used to denote variants 
which in fact fluoresce yellow or blue. 

The sequence of a particular preferred polypeptide of the invention 
(variant GFP) is shown in Figure 3, and its sequence is compared with A. 
victoria GFP and mm GFP5 (Zernicka-Goeta et al (1997) Development 
124, 1133-1137 in Figures 4 and 5. 



It will be appreciated that the functional portion of the polypeptide which 
contains the mutations as said may be incorporated into any suitable 
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polypeptide in which it is desired to have a fluorescent moiety. Typically, 
the functional portion is included in a polypeptide whose fluorescence or 
change in fluorescence is measured under suitable conditions. Thus, the 
polypeptide may be one which is expressed as a reporter molecule (since 
its expression may be measured fluorimetrically). Alternatively, the 
functional portion may be included in a polypeptide which is used in a 
biological system which makes use of FRET. For example, a polypeptide 
of the invention includes a polypeptide which contains the fluorescent 
portion as said, and it is used in conjunction with another fluorescent 
moiety with which it acts as a donor or acceptor in a FRET reaction. 
Most suitably, the polypeptide of the invention contains, in addition to the 
functional portion as said, a further fluorescent moiety in the same 
polypeptide chain and the pair of fluorescent moieties may act as donor- 
acceptor pairs in a FRET reaction. Thus, the polypeptide of the invention 
typically is a fusion protein containing at least the functional portion of the 
polypeptide which contains the mutations as said. 

Thus, the polypeptide of the invention may be used in any suitable prior 
art FRET method. 



The polypeptides of the invention are particularly suited for use in the 
FRET method described in UK Patent Application No 9817229.9 entitled 
"Fluorescent assay for biological systems" and the PCT application 
claiming priority from that application and which was filed on the same 
day as this application since the mutations confer an unusually high 
fluorescent lifetime. The polypeptides of the invention are believed to be 
particularly suited as acceptor molecules since, at least in relation to the 
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molecule of the second aspect of the invention, it excites at 514nm and 
emits at 531nm. 

A particularly preferred polypeptide of the invention is one which has the 
amino acid sequence as shown in Figure 3. 

A further preferred polypeptide of the invention is one which comprises at 
least residues 7 to 229 of green fluorescent protein containing said amino 
acid replacements. The minimal domain required for fluorescence in GFP 
is believed to be amino acids 7 to 229 (Li et al (1997) /. BioL Chem. 272, 
28545-28549. Also, this information, and other information about GFPs, 
is available from Clontech Laboratories, Inc., 1020 East Meadow Circle, 
Palo Alto, CA 94303, USA (xqli@CLONTECH.com) . 

It is preferred that a polypeptide which comprises at least residues 7 to 
229 of green fluorescent protein not only contains the amino acid 
replacements described with respect to the first aspect of the invention, but 
also, compared to native A. victoria GFP, the amino acid replacements 
H25Q, S30R, Q80R, F84L, K209Q, N212H, F213G and A216S. 

A still further preferred polypeptide is one comprising a further 
fluorescent moiety. In particular, the further fluorescent moiety is one 
which is capable of FRET with the said portion of the variant GFP. 
Typically, this further fluorescent moiety is a GFP or a variant GFP. 
Thus, a particularly preferred polypeptide of the invention is a fusion 
polypeptide which contains at least the functional portion of the 
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polypeptide which contains the mutations as said, and contains a further 
mutant GFP. 

A third aspect of the invention provides a polynucleotide encoding a 
polypeptide of the first or second aspect of the invention. The 
polynucleotide may be DNA or RNA; DNA is preferred. A particularly 
preferred polynucleotide of the invention is shown in Figure 3 (DNA 
sequence) but, because of the degeneracy of the genetic code, it will be 
appreciated that other polynucleotides may encode the same polypeptide 
(ie with the amino acid sequence given in Figure 3). 

A fourth aspect of the invention provides an expression vector encoding a 
polypeptide of the first or second aspect of the invention. 

The expression vectors of the invention, and other polynucleotides can be 
constructed by standard laboratory molecular biology methods such as 
those described in Sambrook et al (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, New 
York incorporated herein by reference. 

The polynucleotide of the invention (typically DNA) may be expressed in a 
suitable host to produce a polypeptide comprising the polypeptide of the 
invention. Thus, the DNA encoding the polypeptide of the invention may 
be used in accordance with known techniques, appropriately modified in 
view of the teachings contained herein, to construct an expression vector, 
which is then used to transform an appropriate host cell for the expression 
and production of the polypeptide of the invention. Such techniques include 
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those disclosed in US Patent Nos. 4,440,859 issued 3 April 1984 to Rutter 
et al, 4,530,901 issued 23 July 1985 to Weissman, 4,582,800 issued 15 
April 1986 to Crowl, 4,677,063 issued 30 June 1987 to Mark et al y 
4,678,751 issued 7 July 1987 to Goeddel, 4,704,362 issued 3 November 
1987 to Itakura et al, 4,710,463 issued 1 December 1987 to Murray, 
4,757,006 issued 12 July 1988 to Toole, Jr. et al f 4,766,075 issued 23 
August 1988 to Goeddel et al and 4,810,648 issued 7 March 1989 to 
Stalker, all of which are incorporated herein by reference. 

The polynucleotide, such as DNA, encoding the polypeptide of the 
invention may be joined to a wide variety of other DNA sequences for 
introduction into an appropriate host. The companion DNA will depend 
upon the nature of the host, the manner of the introduction of the DNA into 
the host, and whether episomal maintenance or integration is desired. 

Generally, the DNA is inserted into an expression vector, such as a 
plasmid, in proper orientation and correct reading frame for expression. If 
necessary, the DNA may be linked to the appropriate transcriptional and 
translational regulatory control nucleotide sequences recognised by the 
desired host, although such controls are generally available in the expression 
vector. The vector is then introduced into the host through standard 
techniques. Generally, not all of the hosts will be transformed by the 
vector. Therefore, it will be necessary to select for transformed host cells. 
One selection technique involves incorporating into the expression vector a 
DNA sequence, with any necessary control elements, that codes for a 
selectable trait in the transformed cell, such as antibiotic resistance. 
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Alternatively, the gene for such selectable trait can be on another vector, 
which is used to co- transform the desired host cell. 

Host cells that have been transformed by the recombinant DNA of the 
invention are then cultured for a sufficient time and under appropriate 
conditions known to those skilled in the art in view of the teachings 
disclosed herein to permit the expression of the polypeptide, which can then 
be recovered. 

Many expression systems are known, including bacteria (for example E. 
coli and Bacillus subtilis), yeasts (for example Saccharomyces cerevisiae), 
filamentous fungi (for example Aspergillus), plant cells, animal cells and 
insect cells. 

The vectors include a prokaryotic replicon, such as the ColEl ori, for 
propagation in a prokaryote, even if the vector is to be used for expression 
in other, non-prokaryotic, cell types. The vectors can also include an 
appropriate promoter such as a prokaryotic promoter capable of directing 
the expression (transcription and translation) of the genes in a bacterial host 
cell, such as E. coli, transformed therewith. 

A promoter is an expression control element formed by a DNA sequence 
that permits binding of RNA polymerase and transcription to occur. 
Promoter sequences compatible with exemplary bacterial hosts are typically 
provided in plasmid vectors containing convenient restriction sites for 
insertion of a DNA segment of the present invention. 
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Typical prokaryotic vector plasmids are pUC18, pUC19, pBR322 and 
pBR329 available from Biorad Laboratories, (Richmond, CA, USA) and 
p7rc99A and pKK223-3 available from Pharmacia, Piscataway, NJ, USA. 

A typical mammalian cell vector plasmid is pSVL available from 
Pharmacia, Piscataway, NJ, USA. This vector uses the SV40 late promoter 
to drive expression of cloned genes, the highest level of expression being 
found in T antigen-producing cells, such as COS-1 cells. 

An example of an inducible mammalian expression vector is pMSG, also 
available from Pharmacia. This vector uses the glucocorticoid-inducible 
promoter of the mouse mammary tumour virus long terminal repeat to drive 
expression of the cloned gene. 

Useful yeast plasmid vectors are pRS403-4O6 and pRS413-416 and are 
generally available from Stratagene Cloning Systems, La Jolla, CA 92037, 
USA. Plasmids pRS403, pRS404, pRS405 and pRS406 are Yeast 
Integrating plasmids (Yips) and incorporate the yeast selectable markers 
HIS3, TRP1, LEU2 and URA3. Plasmids pRS413-416 are Yeast 
Centromere plasmids (YCps). 

A variety of methods have been developed to operably link DNA to vectors 
via complementary cohesive termini. For instance, complementary 
homopolymer tracts can be added to the DNA segment to be inserted into 
the vector DNA. The vector and DNA segment are then joined by 
hydrogen bonding between the complementary homopolymeric tails to form 
recombinant DNA molecules. 
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Synthetic linkers containing one _or more restriction sites provide an 
alternative method of joining the DNA segment to vectors. The DNA 
segment, generated by endonuclease restriction digestion, is treated with 
bacteriophage T4 DNA polymerase or E. coli DNA polymerase I, enzymes 
that remove protruding, 3' -single-stranded termini with their 3'-5'- 
exonucleolytic activities, and fill in recessed 3'-ends with their polymerizing 
activities. 

The combination of these activities therefore generates blunt-ended DNA 
segments. The blunt-ended segments are then incubated with a large molar 
excess of linker molecules in the presence of an enzyme that is able to 
catalyze the ligation of blunt-ended DNA molecules, such as bacteriophage 
T4 DNA ligase. Thus, the products of the reaction are DNA segments 
carrying polymeric linker sequences at their ends. These DNA segments 
are then cleaved with the appropriate restriction enzyme and ligated to an 
expression vector that has been cleaved with an enzyme that produces 
termini compatible with those of the DNA segment. 

Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including International 
Biotechnologies Inc, New Haven, CN, USA. 

A desirable way to modify the DNA encoding the polypeptide of the 
invention is to use the polymerase chain reaction as disclosed by Saiki et al 
(1988) Science 239, 487-491. 
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In this method the DNA to be enzymatically amplified is flanked by two 
specific oligonucleotide primers which themselves become incorporated into 
the amplified DNA. The said specific primers may contain restriction 
endomiclease recognition sites which can be used for cloning into expression 
vectors using methods known in the art. 

The present invention also relates to a host cell transformed with a 
polynucleotide vector construct of the present invention. The host cell can 
be either prokaryotic or eukaryotic. Bacterial cells are preferred 
prokaryotic host cells and typically are a strain of E. coli such as, for 
example, the E. coli strains DH5 available from Bethesda Research 
Laboratories Inc., Bethesda, MD, USA, and RR1 available from the 
American Type Culture Collection (ATCC) of Rockville, MD, USA (No 
ATCC 3 1 343) . Preferred eukaryotic host cells include yeast and 
m a m malian cells, preferably vertebrate cells such as those from a mouse, 
rat, monkey or human fibroblastic cell line. Yeast host cells include 
YPH499, YPH500 and YPH501 which are generally available from 
Stratagene Cloning Systems, La Jolla, CA 92037, USA. Preferred 
mammalian host cells include Chinese hamster ovary (CHO) cells available 
from the ATCC as CCL61, NIH Swiss mouse embryo cells NIH/3T3 
available from the ATCC as CRL 1658, and monkey kidney-derived COS-1 
cells available from the ATCC as CRL 1650. 

Transformation of appropriate cell hosts with a DNA construct of the 
present invention is accomplished by well known methods that typically 
depend on the type of vector used. With regard to transformation of 
prokaryotic host cells, see, for example, Cohen et al (1972) Proc. NatL 
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Acad. ScL USA 69, 2110 and Sambrook et al (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY. Transformation of yeast cells is described in Sherman et al (1986) 
Methods In Yeast Genetics, A Laboratory Manual, Cold Spring Harbor, 
NY. The method of Beggs (1978) Nature 275, 104-109 is also useful. 
With regard to vertebrate cells, reagents useful in transfecting such cells, for 
example calcium phosphate and DEAE-dextran or liposome formulations, 
are available from Stratagene Cloning Systems, or Life Technologies Inc., 
Gaithersburg, MD 20877, USA. 

Electroporation is also useful for transforming cells and is well known in the 
art for transforming yeast cells, bacterial cells and vertebrate cells. 

For example, many bacterial species may be transformed by the methods 
described in Luchansky et al (1988) MoL MicrobioL 2, 637-646 
incorporated herein by reference. The greatest number of transformants is 
consistently recovered following electroporation of the DNA-cell mixture 
suspended in 2.5X PEB using 6250V per cm at 25\x¥D. 

Methods for transformation of yeast by electroporation are disclosed in 
Becker & Guarente (1990) Methods Enzymol. 194, 182. 

Successfully transformed cells, ie cells that contain a DNA construct of the 
present invention, can be identified by well known techniques. For 
example, cells resulting from the introduction of an expression construct of 
the present invention can be grown to produce the polypeptide of the 
invention. Cells can be harvested and lysed and their DNA content 
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examined for the presence of the DNA using a method such as that 
described by Southern (1975) 7. Mol BioL 98, 503 or Berent et al (1985) 
Biotech. 3, 208. Alternatively, the presence of the protein in the 
supernatant can be detected using antibodies. 

However, a convenient way of identifying transformed cells which express 
the polypeptide is that they are fluorescent. 

In addition to direcdy assaying for the presence of recombinant DNA, 
successful transformation can be confirmed by well known immunological 
methods when the recombinant DNA is capable of directing the expression 
of the protein. For example, cells successfully transformed with an 
expression vector produce proteins displaying appropriate antigenicity. 
Samples of cells suspected of being transformed are harvested and assayed 
for the protein using suitable antibodies. Of course, transformation and 
expression is indicated by the production of a fluorescent protein in this 
case. 

Thus, in addition to the transformed host cells themselves, the present 
invention also contemplates a culture of those cells, preferably a monoclonal 
(clonally homogeneous) culture, or a culture derived from a monoclonal 
culture, in a nutrient medium. 

A particularly suitable "starting n vector is the pcDNA3.1 vector 
distributed by Invitrogen (Invitrogen BV, De Schelp 12, 9351 NV Leek, 
The Netherlands). The key features of this vector for this invention are: (i) 
Cytomegalovirus enhancer-promoter for high level expression of the insert 
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in mammalian cells (the insert is for example the cDNA encoding the 
polypeptides described above and, has to be cloned into the vector); (ii) 
multiple cloning site in forward and reverse orientation; (iii) expression 
cassette for a selectable marker in eukaryotic cells (neomycin, zeocin or 
hygromycin). 

When the polypeptide of the invention is to be used ex vivo 9 such as in an 
in vitro assay or the like, it may be convenient to express the polypeptide 
in a bacterial system (such as E. coli), or in yeast or insect cells, or in 
other systems which have been designed for facile production of large 
amounts of protein. When the polypeptide of the invention is to be used 
in an in vivo assay it is conveniently expressed in the cell in which the 
assay is to be carried out, typically a mammalian cell. 

In a particularly preferred embodiment, DNA encoding the polypeptide of 
the invention (such as YFP5) may be fused to the promoter/enhancer 
elements of a gene under investigation. Such DNA stably introduced into 
mammalian cells may be used as a reporter for expression of the 
respective gene under investigation. Readout of the activity is the amount 
of polypeptide, such as YFP5, expressed, which can be determined by 
determination of the specific fluorescence of the polypeptide. Similar 
DNAs may be generated for the other GFPs such as those listed in Table 1 
(see below). Since they have overlapping spectra they cannot be used 
simultaneously in the same cells. However, using fluorescent lifetime 
imaging CFP, MmGFP5 and YFP5, for example, could be used 
simultaneously as their lifetimes are sufficiently separated from each 
other. Using multiple frequency FLIM (fluorescent lifetime imaging) the 
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relative amounts of these three GFP mutants expressed in the same cell 
could be determined with high precision and hence the promoter activity 
of at least three genes. 

Multifrequency FLIM is described in UK Patent Application No 
9817227.3 entided "Multiple Frequency Fluorescence Lifetime Imaging" 
and the PCT application claiming priority from that application and which 
has the same filing date as this application. 

The invention will now be described in more detail with reference to the 
following Figures and Example wherein: 

Figure 1 shows the cDNA and amino acid sequence of A victoria green 
fluorescent protein (GFP). 

Figure 2 shows the cDNA and amino acid sequence of a prior art mutant 
GFP (mmGFPS; Zernicka-Goetz et al). 

Figure 3 shows the cDNA and amino acid sequence of a polypeptide of 
the invention (called mmYFP or mYFP5 or YFP5) which is described in 
more detail in Example 1 . 

Figure 4 is a comparison of the cDNA sequences from Figures 1 to 3. 



Figure 5 is a comparison of the amino acid sequences from Figures 1 to 3. 
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Example 1: Construction of mutant GFP and its properties 

The mutant GFP which we call YFP5, which is a red-shifted mutant of 
MmGFP5, was generated by PCR-mediated site-directed mutagenesis of 
MmGFP5 (Zernicka-Goetz et al (1997) Development 124, 1133-1137). 
MmGFP5 is a wtGFP mutated in V163A, S175G, I167T, F64L and 
S65T; the mutations V163A, S175G and I167T were introduced into 
wtGFP by Siemering et al (1996) Current Biol. 6, 1653- , and Zernicka- 
Goetz et al introduced the mutations F64L and S65T). This approach 
introduced mutations S72A and T203Y into MmGFP5 using primer pairs 

ATGCGGCCGCGAATTCGCCACCATGGGTAAAGGAGAAGAACTT and 

CTGGGTATCTTGCGAAG(^TTGTACGTACAATGCTTCGCAAGATACCCAG; and 
GAAAGGGCAGATTGATAGGACAGGTAATGCATTACCTGTC CTATA ATCTGCCCTT 
TC and AAGGATCCTCTAGAAGCTTTTGTATAGTTCATCCATG. The 

underlined nucleotides indicate mismatches. 

The fluorescent lifetimes of various GFP mutants are shown in Table 1. 
References to Table 1 

1. Heim & Tsien (1996). Engineering green fluorescent protein for 
improved brightness, longer wavelengths and fluorescence resonance 
energy transfer. Curr. Bio. 6, 178-182. 



2. Orme M et al (1996). Crystal structure of the Aequorea victoria 
green fluorescent protein. Science 273, 1392-1395. 
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3. Zernicka-Goetz et al (1997). Following cell fate in the living 
mouse embryo. Development 124, 1 133-1 137. 

4. Miyawaki et al (1997). Fluorescent indicators for Ca 2+ based on 
green fluorescent proteins and calmodulin. Nature 388, 882-887. 

The final PCR product was gel-purified, digested with EcoKL and Xbal 
and subcloned into pEFT7MCS. This vector is based on pEF-BOS 
(Nucleic Acids Res. (1990) Sep 11; 18(17), 5322 pEF-BOS, a powerful 
mammalian expression vector. Mizushima S, Nagata S). A modified 
version of pEF-BOS containing a Neo resistance expression cassette, 
pEFl-Neo, was obtained from G. Baier, Innsbruck. The Neo expression 
cassette to make the vector smaller and introduced a T7 RNA polymerase 
promoter as well as several unique restriction enzyme sites downstream of 
the human EFla promoter and the SV40 polyadenylation site. 

Any other suitable vector, as described in the specification, may be used 
for the expression of the mutant GFP. The introduced mutations were 
verified by sequencing using Sequenase. The sequence of YFP5 is given 
in Figure 3. 

The respective GFP mutants were expressed in cells by microinjection 
(Pepperkok et al 9 1997 in "Microinjection and Transgenesis", eds. Cid- 
Arregui and Garcia-Carranca , Springer, Heidelberg , pp 1 45- 1 54) of 
plasmids based on the vector pEFT7MCS and with inserts of the 
respective GFP encoding cDNAs. At 2h after microinjection cells were 
mounted on the FLIM microscope sef-up and the respective lifetimes were 



W ° 00/08054 PCT/GB99/02596 

19 

determined at 37 °C in living cells. Any suitable expression system or 
lifetime-detection system may be used. 



YFP5 shows a well-separated and significantly longer lifetime than other 
GFP mutants making it an ideal partner in multi-labelling FLIM 
experiments. 
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CLAIMS 

1. A polypeptide comprising the functional portion of green 
fluorescent protein but wherein valine (V) 163 has been replaced with 
alanine (A), serine (S) 175 has been replaced with glycine (G), isoleucine 
(I) 167 has been replaced with threonine (T), phenylalanine (F) 64 has 
been replaced with leucine (L), serine (S) 65 has been replaced with 
threonine (T), serine (S) 72 has been replaced with alanine (A), and 
threonine (T) 203 has been replaced with tyrosine (Y). 

2. A polypeptide which has the amino acid sequence 

MGKGEELFTGWPILVEIJX3DVNGQKFSVPGEGEGDATYGKLTLKFICTTGKXPVPWPTLVTTLTY 
GVQCFARYPDHMKR 

HDFLKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVmiELKGIDFKEDGNILGHKLEYNYN 
SHNVYIMADKQKNG 

IKANFKTRHNIEDGGVQLADHYQQNTPIGDGPVIjLPDNHYLSYQSAIjSQDPHGKRDHMVLLEFVTS 
AG I THGMDELYKS F . 

3. A polypeptide according to Claim 1 which has the additional amino 
acid replacements S2G, H25Q, S30R, Q80R, F84L, N212H, E213G, 
A216S, and additional residues 239S and 240F. 

4. A polypeptide comprising at least residues 7 to 229 of green 
fluorescent protein containing the amino acid replacements as defined in 
any of Claims 1 to 3. 



WO 00/08054 



22 



PCT/GB99/02596 



5. A polypeptide according to any one of Claims 1, 3 and 4 
comprising a further fluorescent moiety. 

6. A polypeptide according to Claim 5 wherein the further fluorescent 
moiety is a green fluorescent protein or a variant thereof. 

7. A polypeptide according to any one of Claims 1, and 3 to 6 which 
is a fusion polypeptide. 

8. A polypeptide according to Claim 7 wherein the fusion polypeptide 
is one used in a biological system which makes use of FRET. 

9. A polynucleotide encoding a polypeptide according to any one of 
the preceding claims. 

10. An expression vector encoding a polypeptide according to any one 
of Claims 1 to 8. 

11. A host cell comprising a polynucleotide according to Claim 9 or an 
expression vector according to Claim 10. 

12. Use of a polypeptide according to any one of Claims 1 to 8 as a 
reporter molecule in a cell. 

13. Use of a polynucleotide according to Claim 9 or an expression 
vector according to Claim 8 to express a reporter molecule in a cell. 
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14. 



Any novel fluorescent protein as herein described. 
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Figure 1 

A. via or i a GFP: 
cDNA: 

atgagtaaaggagaagaacttctcactggagctgtcccaatccttgtcgaattagatggtgatgttaatgggcacaaatt 
ttctgtcagtggagagggtgaaggtgatgcaacatacggaaaacttacccttaaatttatttgcactactggaaaactac 
ctgttccatggccaacacttgtcactactttctcttatggtgttcaatgcttttcaagatacccagatcatatgaaacag 
catgactttttcaagagtgccatgcccgaaggttatgtacaggaaagaactatatttttcaaagatgacgggaactacaa 
gacacgtgctgaagtcaagtttgaaggtgatacccttgttaatagaatcgagttaaaaggtattgattttaaagaagatg 
gaaacattcttggacacaaattggaatacaactataactcacacaatgtatacatcatggcagacaaacaaaagaatgga 
atcaaagctaacttcaaaattagacacaacattgaagatggaagcgttcaactagcagaccattatcaacaaaatactcc 
aattggcgatggccctgtccttttaccagacaaccattacctgtccacacaatctgccctttcgaaagatcccaacgaaa 
agagagaccacatggtccttcttgagtttgtaacagctgctgggattacacatggcatggatgaactatacaaataa 

protein: 

MSKGEELFTGWPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPT 

HDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRI ELKGIDFKEIXSNII^HKLEYNYNSHNVYIMADKOKNG 
IKVNFKIRHNIEIXSSVQI^HYQQOTPIGDGPVL^ 
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Figure 2 



mmGFP5. 
CDNA: 

atgggtaaaggagaagaacccttcactggagctgtcccaattcttgttgaaccagacggtgatgttaatgggcaaaaacc 

ctctgtcaggggagagggtgaaggtgatgcaacatacggaaaacttacccttaaatctactcgcactactgggaagctac 

ctgttccctggccaacacttgtcactactttgacttatggtgtacaatgcttctcaagatacccagatcatatgaagcgg 

cacgacttcctcaagagcgccatgcctgagggatacgcgcaggagaggaccatcttcttcaaggacgacgggaactacaa 

gacacgtgctgaagtcaagtttgagggagacaccctcgtcaacaggatcgagcttaagggaatcgatttcaaggaggacg 

gaaacatcctcggccacaagttggaatacaactacaactcccacaacgtatacatcacggccgacaagcaaaagaacggc 

atcaaagccaactccaagacccgccacaacatcgaagacggcggcgtgcaactcgctgatcattatcaacaaaatactcc 

aattggcgatggccccgtccttttaccagacaaccattacctgtccacacaatctgcccctccccaagatccccacggaa 

agagagatcacatggcccctcctgagttcgttacacctgctgggattacacatggcatggacgaactatacaaaagcttc 
tag 



protein: 

MGKGEELFTGWPILVELDGDVNGQKFSVRGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVOCFSRYPDHMKR 
HDFLKSAMPEGYVQERTIFFKDDGNYKTR^ 

IK^FKTRHNIEDGGVQI^HYCX?OTPIGDGPVU,PD^ 
Z 
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Figure 3 



mmYFP: 
cDNA: 

atgggtaaaggagaagaacctttcaccggagttgtcccaatccttgttgaattagatggtgacgctaatgggcaaaaatt 

ctctgtcaggggagagggtgaaggtgatgcaacatacggaaaacttacccttaaatttatttgcactactgggaagctac 

ctgttccctggccaacacttgtcactactttgacctatggtgtacaatgcttcGcaagatacccagatcatatgaagcgg 

cacgacttcctcaagagcgccacgcctgagggatacgtgcaggagaggaccatcttcttcaaggacgacgggaactacaa 

gacacgtgctgaagtcaagtttgagggagacaccctcgtcaacaggatcgagcttaagggaatcgatttcaaggaggacg 

gaaacatcctcggccacaagttggaatacaactacaactcccacaacgtatacatcatggccgacaagcaaaagaacggc 

atcaaagccaacttcaagacccgccacaacatcgaagacggcggcgtgcaactcgctgatcattatcaacaaaatactcc 

aattggcgatggccctgtccttttaccagacaaccattacctgtccTATcaatctgccctttcccaagatccccacggaa 

agagagatcacatggtccttcttgagtttgttacatctgctgggattacacatggcatggatgaactatacaaaagcttc 
tag 



protein: 

MGKGEELFTGWPILVEUX^VNGQKFSVRGEGEGDATYGKLTLKFICT^ 
HDFLKSAMPEGYVQERTI FFKDDGNYICTRAEVKFEGDTLVNRIELK^ 
IKANFKTRHNIEDGGVQIJUDHYWhrrPIGDGPVLLPDbWyLS 
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Nucleic acid alignment: 



Figure 4 (page 1 of 3) 

Nucleic acid alignment: 

- - ■ -10 . . . .20 . . . .30 . . . .40 . . . .50 60 

A. vict .GFP 1 : ATG a GT AAA GG AG AAGAACTTTTCACTGG AG TTGTCCCAATTCTTGTT : 4 8 

MMGFP5 1 : GAATTCGCCACCATGGGTAAAGGAGAAGAACTTTTCACT . 6 Q 

MMYFPS 1 : GAATTCGCCACCATGGGTAAAGGAGAAGAACTTTTCACT^^ : 6 0 

consensus 1:***********M!!M!M!! ! ■!!!!!!!!■!!*!!!* ••'•••ttiiiiiiiiiit* , ~ 

... -70 ... .80 ... .90 . 100 . . . 110 . . . 120 

A. vict - GFP 4 9 : GAATT AG ATGGTGATGTTAATGGGCAcAAATT t TCTGTCAG t GG AGAGGGTG AAGGTG AT z 1 0 8 
MMGFP5 6 1 : GAATTAGLA.TGGTGATGTTAATGGGCAAAAATTCTCT . 120 

MMYFP5 6 1 : GAATT AGATGGTGATGTTAA TGGGCAAPiAATTCTCTG T CAGGGG AG AGGGTG AAGGTGAT : 1 2 0 

consensus 6 1 : ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! * !!!!!!!!*!!!!!*! ' !!!!!!*!!!!!!!! i t I t i i t i i i X20 

. . . 130 . . . 140 . . . 150 . . . 160 . . . 170 . . . 180 
A. vict .GFP 109: GCAACATACGGAAAACTTACCCTTAAATTTATT^ ; 168 

MMGFP5 121: GCAACATACGGAAAACTTACCCTTAAATTTATTTGCACTAC^ : 1 8 0 

MMY FP5 121: GCAACATACGGAAAACTTACCCTTAAATTTATTTGCA : ! 8 0 

consensus 121 r!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!'*!!*!!!!;;!!!! 1 ** 180 

. . . 190 . . . 200 . . . 210 . . . 220 . . . 230 . . . 240 
A. vict. GFP 169 : TGGCCAACACTTGTCACTACTTTc t CTTATGGTGT t CAATGCTT t T CAAG AT AC CCAG AT : 2 2 8 
MMGFP5 181: TGG CCAACACTTGTCACTACrTTGACTT ATGGTGTACAATG CTT CTCAAG ATACCCAG AT : 2 4 0 

MMYFP5 181: TGGCCAACACTTGTCACTACTTTG ACTTATGGTGTACAATG CTT Cg CAAGATACCCAG AT : 2 4 0 

Consensus 181:l!!!!!l!!!!!!!!ll!!!!!! , **!!!!!!!!!!*!!!*!!!***!!!!!!!t!!ii!! : 240 

. . . 250 . . . 260 . . . 270 . . . 280 . . . 290 . . . 300 
A. vict . GFP 229 : CATATGAAa Ca GCA t G ACTT 1 1 TCAAG AG t GCC ATGCCcG Aa GG t TA t GTa CAGGAaAG a : 2 88 
MMGFPS 241: CATATGAAGCGGCACGACTTCCTCAAGAGCGCCATGCCTGAGGGATACGTGCAGGAGAGG : 3 0 0 

MMYFP5 241: CATATGAAGCGGCACGACTTCCTCAAGAGCGCCATGCCTGAGGGATACGTGCAGGAGAGG : 3 00 

consensus 241:!!!!'!!!*!* !!!*!!!!!**!!!!!■!*!!!!!!!!*!!*!!*! I*!!*!!!!!*!!*: 300 

- - - 310 . . . 320 . . . 330 . . . 340 . . . 350 . . . 360 
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Figure 4 (page 2 of 3) 

A . vie t . GFP 2 8 9: ACt ATaTTt TTCAAaGAtGACGGGAACTACAAGACACGTGCTGAAGTCAAGTTTGAaGG t : 3 4 8 

MMGFP5 3 01: ACCATCTTCTTCAAGGACGACGGGAACTACAAGACACGTG : 360 

MMYFP5 3 01: ACCATCrTC ITCAAGGACGACGGGAACTACAAGACACGTGCTGAAGTCAAGTTTGAGGGA : 36 0 

Consensus 301:!!M!M!MM!!M!*!!M!!!M!M!!!!!!!!!!!!!!!!H!!!!!!M*!!**360 



. . . 370 . . . 3 80 . . . 3 90 . . . 4 00 . . . 410 . . . 420 
A . vict . GFP 349: G A tACCCT t GT t AA t AGa ATCG AG t Ta AAa GG t AT t G ATTTt AAa G Aa G A t GG AAA CAT t : 4 0 8 
MMGFP5 3 61: GACACCCTCGTCAACAGGATCXjAGCTTAAGGGAATCGATTTCAAGGAGGA :420 

MMYFP5 361: GACACCCTCGTCAACAGGATCGAGCTTAAGGGAATCGATTTCA^ :42 0 

consensus 361:!!*!!!!!*!!*!!*!!*!!!!!!*!*!!*!!*!!*!!!!!*!!*!!*!!*!!!!l!!!*:420 



. . . 430 . . . 440 . . . 450 . . . 460 . . . 470 . . . 480 
A.vict.GFP 409: CT t GGa CACAAa TTGGAATACAACTA t AACTCa CACAA t GT AT ACATCATGGCaGACAAa : 4 6 8 
MMGFP5 4 21: CTCGGCCACAAGTTGGAATACAACTACAACTCCCACAACGTATA(^ : 4 8 0 

MMYFP5 4 21: CTCGGCCACAAGTTGGAATACAACTACAACTCCCACT^ : 4 8 0 

consensus 421:!!*!!*!!!!!*!!!!!!!!!!!!!!*!!!!!*!!!!I*!!!I!!!!!! ! !!!*! !! ! ! * : 4 8 0 



. . . 490 . . . S00 . . . 510 . . . 520 . . . 530 . . . 540 
A. vie t .GFP 4 6 9: CAAAAG AA t GG a ATCAAAG 1 1 AACTTCAAaA 1 1 aGa CACAACAT t G AAG A t GGa a GCGT t : 528 
MMGFP5 4 81: CAAAAG AA CGG CATCAAAGCCAACTTCAAG A CCCGCCACAA CAT CG AAG ACGGCGG CGTG : 5 4 0 

MMYFP5 4 81: CAAAAGAACGGCATCAAAGCCAACTTCAAGACCCGCCAC : 5 4 0 

consensus 481:!!!!!!!!*! !*!!!!!!!**!!!!!! ! !*!***!*!!! !! ! !!*!!!! ! * ! !**! ! ! !*: 54 0 



. . . 550 . . . 560 . . . 570 . . . 580 . . . 590 . . . 600 
A . vie t . GFP 52 9: CAACTaGCa GAc CATTATCAACAAAATACTCCAATTGGCXj ATGGCC^ : 5 8 8 

MMGFP5 54 1 : CAACTCGCTGATCATTATCAAGAAAATACT : 60 0 

MMYFP5 541: CAACTCGCTGATCATTATCAACAAAATACTCCAATT^ : 6 0 0 

consensus 54 1 :!!!!!*!!*■!*!!!!!<!!!!!! I I!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! ! :600 



. . . 610 . . . 620 . . . 630 . . . 640 . . . 650 . . . 660 
A.vict. GFP 589: GACAACCATTACCTGTCCACACAATCTGCCCTTTC^ : 6 4 8 

MMGFPS 6 01: GACAACCATTACCTGTCCACACAATCTGCCCTTTCCCAAGATCCCCACGGAAAGAGAGAT : 6 6 0 

MMYFPS 601 :GACAACCATTACCTGTCCtatCAATCTGCCCTTTCCCAAGATCCCCACGGAAAGAGAGAT: 660 

consensus 601:!!!!!!!!!!!!!!!!!!***!!!!!!!!!!!!!!**!!!!!!!!*!!!*!!!!!!!!!*:660 
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Figure 4 (page 3 of 3) 



670 . . . 680 . . . 690 . . . 700 . . . 710 . . 7 20 

708 



A.vict .GFP 6 4 9: CACATGGTCCTTCTTG AGTTTGTa ACAg CTGCTGOG ATT ACACATGGCATGG ATG AACTA 



MMGFP5 6 61: CACATGCTCCTTCTTGAGTTTGTTACATCTGCTG^ ; J2Q 

MMYFP5 6 61: CACATCXH^CTTCTTGAGT^ : 72Q 

720 



consensus 66 1 :!!!!!! M * MM M ! M ! MM * ! !! *•!»!!!•!!!!!! M ' 



. 730 . 



A. vict .GFP 


709 


:TACAAAtaa 


717 


MMGFP5 


721 


: TACAAAAGCTTCTAGA 


736 


MMYFP5 


721 


TACAAAAGCTTCTAGA : 


736 


consensus 


721 




736 
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Figure 5 

protein alignment : 



A.vict. 1 :MsKGEELFTGWPI LVELTCDWGhKFSVsGEGEGDATYGKLTLKFICTTTGfCLPVPWPTL : 60 

mmGFPS 1 : MGKGEELFTGVVPILVEIX>GDVNGQKFSVRGEGEGDATYGK^ : 60 

mmYFPS 1 iMGKGEELFTGWPILVELDGDVNGQKFSVRGEGEGDATYGKXTLKFICTTGKLPVPWPTL : 60 

consensus l:!*!!!!!!!!!!!!!!!!!!!!!!^!!!^!!!!!!!!!!!!!!!!!!!!!!;^!,,,. 6Q 



- - - -70 ... .80 ... .90 ... 100 .. . 110 . . . 120 
A.vict. 61 :VTTf sYGVQCFSRYPDHMKqHDFf KSAMPEGYVQERTIFFKDDGNYKTRAE^FEGDTLV : 120 

mmGFPS 61 : VTTLTYGVQCFSRYPDHMKRHDFLKS^ 120 

mmYFPS 61 : VTTLTYGVQCFaRYPDHMKRHDFLKSAMPEGYVQERTI FFKDIX3NYCTRAEVKFEGDTLV : 120 

consensus ^ 1 :!!!**!•!> M * ! I !!! I I*!!!*!!! ! M ! !! 1 !!•!!!!!!!!! !! I !!!!? ! !!!!! : 120 



. - . 130 . . . 140 . . . 150 . . . 160 . . . 170 . . . 180 
A.vict. 121: NRIELKGIDFKEDGNI LGHKLEYKYNSHNVYIMADKQKNGI KvNFKiRHNI EDGsVQLAD : 1 8 0 
mmGFPS 121: NRI ELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGI KANFKTRHNI EDGGVQLAD : 1 8 0 

mmYFPS 121 :1TOIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQK^ : iso 

consensus 121:! U! !!!!!! n Ml!!!!!!!!!!!!!!!!!! !!! ! !)(! !*; | !*) i i ; t t «*!! !! I -180 



• • - 190 . . . 200 . . . 210 . . . 220 . . . 230 . . . 240 
A.vict . 181 : HYQQNTPI GIX3PVl»LPD^YLST0SAI»SkDPneKRDHMVIJjEFVTaAG I THGMDELYKz . :238 
mmGFPS 181 : HYQQNTP I GDG PVX.LPDNH YI*STQSAI>SODPHGKRDHMVtiLEFVTS AG I THGMDEL* YKS F : 24 0 

mmYFPS 181: HYQQNTP I GIX3PVljIiPDNHYl*SyQS ALSQDPHGKRDHMVLIjEFVTS AG I THGMDELYKS F : 24 0 

consensus 181 :!!!!!!*» ! •!!!!! 1! !!!!!*! 1 ! i !*!!*♦!!!!! 1 I !!!! ! * !• i i !! I !!*!!** : 240 
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<110> Bastiaens, Philippe 
Pepperkok, Rainer 
Geley, Stefan 

Imperial College Research Technology Limited 

<120> Fluorescent protein 

<130> IMPWP21223 

<140> 
<141> 

<150> GB 9817225.7 
<151> 1998-08-08 

<160> 6 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 722 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Mutant GFP 
<400> 1 

atgggtaaag gagaagaact tttcactgga gttgtcccaa ttcttgttga attagatggt 60 
gatgttaatg ggcaaaaatt ctctgtcagg ggagagggtg aaggtgatgc aacatacgga 120 
aaacttaccc ttaaatttat ttgcactact gggaagctac tgttccctgg ccaacacttg 180 
tcactacttt gacttatggt gtacaatgct tcgcaagata cccagatcat atgaagcggc 240 
acgacttcct caagagcgcc atgcctgagg gatacgtgca ggagaggacc atcttcttca 300 
aggacgacgg gaactacaag acacgtgctg aagtcaagtt tgagggagac accctcgtca 360 
acaggatcga gcttaaggga atcgatttca aggaggacgg aaacatcctc ggccacaagt 420 
tggaatacaa ctacaactcc cacaacgtat acatcatggc cgacaagcaa aagaacggca 480 
tcaaagccaa cttcaagacc cgccacaaca tcgaagacgg cggcgtgcaa ctcgctgatc 540 
attatcaaca aaatactcca attggcgatg gccctgtcct tttaccagac aaccattacc 600 
tgtcctatca atctgccctt tcccaagatc cccacggaaa gagagatcac atggtccttc 660 
ttgagtttgt tacatctgct gggattacac atggcatgga tgaactatac aaaagcttct 720 
ag 722 

<210> 2 
<211> 240 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : Mutant GFP 
<400> 2 

Met Gly Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu VaJ. 
1 5 10 15 

Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Arg Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
50 55 60 

Thr Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys Arg 
65 70 75 80 

His Asp Phe Leu Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

lie Lys Ala Asn Phe Lys Thr Arg His Asn lie Glu Asp Gly Gly Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu Ser 
195 200 205 



Gin Asp Pro His Gly Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 



Thr Ser Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys Ser Phe 
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<210> 3 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 3 

atgcggccgc gaattcgcca ccatgggtaa aggagaagaa ctt 43 

<210> 4 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; PCR primer 
<400> 4 

ctgggtatct tgcgaagcat tgtacgtaca atgcttcgca agatacccag 50 

<210> 5 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 5 

gaaagggcag attgatagga caggtaatgc attacctgtc ctataatctg ccctttc 57 

<210> 6 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 



<400> 6 
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aaaaggatcc tctagaagct tttgtatagt tcatccatg 39 
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